Discriminative models for robust image classification

نویسنده

  • Umamahesh Srinivas
چکیده

A variety of real-world tasks involve the classification of images into pre-determined categories. Designing image classification algorithms that exhibit robustness to acquisition noise and image distortions, particularly when the available training data are insufficient to learn accurate models, is a significant challenge. This dissertation explores the development of discriminative models for robust image classification that exploit underlying signal structure, via probabilistic graphical models and sparse signal representations. Probabilistic graphical models are widely used in many applications to approximate high-dimensional data in a reduced complexity set-up. Learning graphical structures to approximate probability distributions is an area of active research. Recent work has focused on learning graphs in a discriminative manner with the goal of minimizing classification error. In the first part of the dissertation, we develop a discriminative learning framework that exploits the complementary yet correlated information offered by multiple representations (or projections) of a given signal/image. Specifically, we propose a discriminative tree-based scheme for feature fusion by explicitly learning the conditional correlations among such multiple projections in an iterative manner. Experiments reveal the robustness of the resulting graphical model classifier to training insufficiency. The next part of this dissertation leverages the discriminative power of sparse signal representations. The value of parsimony in signal representation has been recognized for a long time, most recently in the emergence of compressive sensing. A recent significant contribution to image classification has incorporated the analytical underpinnings of compressive sensing for classification tasks via class-specific dictionaries. In continuation of our theme of exploiting information from multiple signal representations, we propose a discriminative sparsity model for image classification applicable to a general multi-sensor fusion scenario. As a specific instance, we develop a color image classification framework that combines the complementary merits of the the red, green and blue channels of color images. Here signal structure manifests itself in the form of block-sparse coefficient matrices, leading to the formulation and solution of new optimization problems. As a logical consummation of these ideas, we explore the possibility of learning discriminative graphical models on sparse signal representations. Our efforts are inspired by

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Method for E-Maximization and Hierarchical Clustering of Image Classification

We developed a new semi-supervised EM-like algorithm that is given the set of objects present in eachtraining image, but does not know which regions correspond to which objects. We have tested thealgorithm on a dataset of 860 hand-labeled color images using only color and texture features, and theresults show that our EM variant is able to break the symmetry in the initial solution. We compared...

متن کامل

Palarimetric Synthetic Aperture Radar Image Classification using Bag of Visual Words Algorithm

Land cover is defined as the physical material of the surface of the earth, including different vegetation covers, bare soil, water surface, various urban areas, etc. Land cover and its changes are very important and influential on the Earth and life of living organisms, especially human beings. Land cover change monitoring is important for protecting the ecosystem, forests, farmland, open spac...

متن کامل

Research Statement Discriminative Graphical Models for Image Classification

A large number of real-world classification problems in the domain of text, speech, image, biology, etc. involve labeling of data items that have multiple components, each requiring a classification label. These problems are challenging because the interactions among the components can be rich and complex. I am interested in sound theoretical formulations that can model such complex interaction...

متن کامل

Automatic road crack detection and classification using image processing techniques, machine learning and integrated models in urban areas: A novel image binarization technique

The quality of the road pavement has always been one of the major concerns for governments around the world. Cracks in the asphalt are one of the most common road tensions that generally threaten the safety of roads and highways. In recent years, automated inspection methods such as image and video processing have been considered due to the high cost and error of manual metho...

متن کامل

BrainParser: Deformable Templates Guided Discriminative Models for Robust 3D Brain MRI Segmentation

Automatically segmenting anatomical structures from 3D brain MRI images is an important task in neuroimaging. One major challenge is to design and learn effective image models accounting for the large variability in anatomy and data acquisition protocols. A deformable template is a type of generative model that attempts to explicitly match an input image with a template (atlas), and thus, they ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1603.02736  شماره 

صفحات  -

تاریخ انتشار 2016